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C^l . Abstract 
> 

I We derive the capacity region of the state-dependent semideterministic broadcast channel with non- 

^ . causal state-information at the transmitter. In this broadcast channel one of the outputs is a deterministic 

function of the channel input and the channel state, and the state is assumed to be known noncausally 
to the transmitter but not to the receivers. 



C/3 



I. Introduction 



I We study the capacity region of the discrete, memoryless, state-dependent, semideterministic 

^ ■ broadcast channel. Such a channel has a single transmitting node, two receiving nodes, and an 
internal state, all of which are assumed to take value in finite sets. One of the receiving nodes 
observes a symbol Y that is a deterministic function of the transmitted symbol x and the state S 

Y = f{x,S) with probability one, (la) 

X' 

■ and the other receiving node observes a symbol Z which is random: conditional on the input 
being x and the state being s, the probability that it be z is iy(2;|x, s) 

Pr{Z = z\X = x,S = s) = W{z\x,s). (lb) 

The state sequence S is assumed to be independent and identically distributed (IID) according 
to some law Psi ) 

Fr{S = s) = Ps{s) (Ic) 
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and to be revealed to the encoder in a noncausal way: all future values of the state are revealed 
to the transmitter before transmission begins. 

The definition of the capacity region of this channel is analogous to that of a broadcast channel 
without a state [1], [2]. The main result of this paper is a single-letter characterization of the 
capacity region: 

Theorem 1: The capacity region of the channel (1) when the states are known noncausally to 
the transmitter is the convex closure of the union of rate-pairs (i?i, i?2) satisfying 

Ri<H{Y\S) (2a) 
i?2 < /([/; Z) - I{U; S) (2b) 
Ri + R2< H{Y\S) + I{U; Z) - I{U; S, Y) (2c) 

over all joint distribution on (X, Y, Z, S, U) whose marginal Ps is the given state distribution 
and under which, conditional on X and S, the channel outputs Y and Z are drawn according 
to the channel law (1) independently of U: 

PxYzsu{x,y,z,s,u) = Ps{s)Pxu\s{x,u\s)l{y = f{x,s)}W{z\x,s). (3) 

Here 1{ } denotes the indicator function.^ Moreover, the capacity region remains the same even 
if the state sequence is revealed to the deterministic receiver, i.e., if we replace /(•, •) with the 
mapping {x,s) ^ {f{x,s),s). 

State-dependent broadcast channels were considered before [3], [4]. Steinberg [3] studied the 
degraded state-dependent broadcast channel with causal and with noncausal side-information at 
the transmitter. He derived the capacity region for the causal case, but for the noncausal case 
his outer and inner bounds do not coincide. Steinberg and Shamai [4] then derived an inner 
bound for general (not necessarily degraded) state-dependent broadcast channels with noncausal 
side-information. This inner bound is based on Marton's inner bound for broadcast channels 
without states [5] and on Gel'fand-Pinsker coding [6]. In fact, the direct part of our Theorem 1 
can be deduced from [4] with a proper choice of the auxiliary random variables (see Section 
III- A). However, capacity regions of most state-dependent broadcast channels are still unknown. 

'The value of l{statement} is 1 if the statement is true and is otherwise. 
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Much work has been done on broadcast channels without states [7]. Our work can be consid- 
ered as an extension of previous works on deterministic broadcast channels (solved by Gel'fand, 
Marten and Pinsker [8]-[10]) and on semideterministic broadcast channels (solved by Gel'fand 
and Pinsker [11]). These results can be found in [2]. 

hi the rest of this paper we discuss some special cases of Theorem 1 (Section II) and then 
prove the direct and converse parts of Theorem 1 in Sections HI and IV. 

II. Simple Special Cases 

In this section we discuss two special cases of Theorem 1 with self-contained proofs which 
are simpler than the proof of Theorem 1. The first special case is a single-user deterministic 
channel described by (la), where the states are known noncausally to the transmitter. In this 
case. Theorem 1 reduces to the following: 

Theorem 2: The capacity of the single-user deterministic channel (la) when the states are 
known noncausally to the transmitter is 

C = maxi/(y|5) (4) 

Px\s 

= Y,Ps{s)\og\{y:3x, y = (5) 

s 

where the conditional entropy on the right-hand side (RHS) of (4) is computed for the joint distri- 
bution PxYsi'^, V: s) — Ps{s)Px\s{x\s)l{y — f{x, s)}. Furthermore, the capacity is unchanged 
when the state sequence is also revealed to the receiver. 

Proof: To prove the direct part, we use the formula of Gel'fand and Pinsker [6]: 

C= max I(U;Y) - I(U;S), (6) 

U^-{X,S)^-Y 

where we choose the auxiliary random variable U to be Y. Note that because F is a deterministic 
function of X and S, this choice is valid in the sense that [/— o— (X, S)—o—Y forms a Markov 
chain. Consequently, 

C>I{Y-Y)-I{Y-S) (7) 
^H{Y)-{H{Y)-H{Y\S)) (8) 
^H{Y\S), (9) 
which, when optimized over Px|5> proves the direct part. 
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To prove the converse, we first note that the capacity C is upper-bounded by the capacity 
Cboth corresponding to the case where the state sequence is known to both the transmitter and 

the receiver, a capacity which is given by 



Cboth = max 1-5) (10) 

Px\s 

^m&-KH{Y\S) - H{Y\X,S) (11) 



Px\s 

= maxi/(F|5). (12) 

Px\s 

This establishes (4). And from (4) follows (5) by noting that entropy is maximized by the uniform 
distribution. ■ 
The second special case we consider is the fully deterministic broadcast channel, where the 
symbols received at both receiving nodes are deterministic functions of the channel input and 
the channel state: 

Y^h{x,s) (13a) 
Z^h{x,s). (13b) 

For this channel we have the following: 

Theorem 3: The capacity region of the channel (13) when the state sequence is drawn UD 
Ps and is known noncausally to the transmitter is the convex closure of the union of rate-pairs 

i?2) satisfying 

i?i < H{Y\S) (14a) 
i?2 < H{Z\S) (14b) 
Ri + R2<H{Y,Z\S) (14c) 

where the union is over all choices of Px\s^ and where the conditional entropies are computed 
with respect to the joint distribution PxYzsi^i Hi s) = Ps[,s)Px\s{^'^\s)'^\y = = 
/2(a;, s)}. Moreover, this is also the capacity region when the state sequence is also revealed to 
both receivers. 



Proof sketch. The direct part is proved like the direct part of Theorem 1 and is omitted. But the 
converse is greatly simplified: Let the state sequence be known to the transmitter and to both 
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receivers. For the sum rate, we have 

n{Ri + R2) = H{Mi,M2) (15) 

< /(Ml, M2; y^ Z", S^) + nsn (16) 

= /(Mi,M2;F",Z"|5")+nen (17) 

n 

= J] /(Ml, M2; y,, Z^-^) + nen (18) 



1=1 



= Y,H{Y„Z,\S^,Y'-\Z'-')+ner, (19) 

i=l 

n 

<5^//(y„Z,|5,) + ne„, (20) 
1=1 

where e„ is a function of n which decays to zero as n goes to infinity. Here, (16) follows from 
Fano's Inequality; (17) because 5"" is independent of (Mi,M2); (18) from the chain rule; (19) 
by expanding mutual information and recalling that the outputs are deterministic functions of the 

state and input; and (20) because conditioning cannot increase entropy. By similarly bounding 
Ri and R2 and letting n tend to infinity, we obtain the converse part of Theorem 3. □ 

111. Direct Part 

In this section we prove the direct part of Theorem 1. One way to do this is to use [4, Theo- 
rem 1] with a judicious choice of the auxiliary random variables, as we propose in Section EI- A. 
For completeness and simplicity, we also provide a self-contained proof in Section III-B. 

A. Proof based on [4] 

It was shown in [4, Theorem 1] that the capacity region of any (not necessarily semideter- 
ministic) state-dependent broadcast channel with noncausal side-information at the transmitter 
contains the convex closure of the union of rate-pairs (/?i, /?2) satisfying 

Ri < liUo, C/i; Y) - /(C/o, Ui, S) (21a) 

R2 < I{Uo, U2; Z) - I{Uo, U2, S) (21b) 

R1 + R2 < -[max{/([/o;y),/(C/o;Z)}-/(C/o;>5)] + 

+ I{Uo,U,;Y)-I{Uo,U,;S) 

+ I{Uo, U2; Z) - I{Uo, U2; S) - I{Ui; U2\Uo, S), (21c) 
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where the union is over all joint distributions of {X, Y, Z, S, Uq, Ui, U2) whose marginal is P5; 
that satisfy the Markov condition 

(C/o, C/i, t^2)^(X, S)^{Y, Z); (22) 

and under which the conditional law of [Y. Z) given (X, S) is that of the given channel. 
For the semideterministic channel, we choose the auxiliary random variables in (21) as follows: 

C/q = (deterministic) (23a) 
Ui^Y (23b) 
U2 = U. (23c) 

Note that the Markov condition (22) is satisfied because y is a deterministic function of {X, S) 
and because in Theorem 1 we restrict U to be such that U—o—[X, 5)— o— (Y, Z). With this choice 
of Uq, Ui, and U2, (21) reduces to (2). 

B. Self-Contained Proof 

We next provide a self-contained proof of the direct part of Theorem 1. Like [4, Theorem 
1], our proof is based on Marton's inner bound for general broadcast channels [5], [12] and on 
Gel'fand-Pinsker coding [6]. 

First note that the joint distribution (3) can also be written as 

PxYzsu{x,y,z,s,u) = Ps{s)PYu\s{y,u\s)Px\YSu{x\y, s,u)W{z\x, s) (24) 

with the additional requirement that 

y — f{x, s) with probability one. (25) 

Further note that, when Pysu is fixed, all the terms on the RHS of (2) are fixed, except for 
I{U; Z), which is convex in Px\ys when Pysu is held fixed. Since I{U; Z) only appears with 
a positive sign on the RHS of (2), it follows that the union over all joint distributions of the 
form (2) can be replaced by a union only over those where x is a deterministic function of 
(y, u, s), i.e., of the form 

PxYzsu{x,y,z,s,u) = Ps{s)PYu\s{y,u\s)l{x = g{y,u,s)}W{z\x,s) (26) 
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for some g : {y, u,s)h^x (and subject to (25)). We shall thus only establish the achievability of 
rate pairs that satisfy (2) for some distribution of the form (26). 

Choose a stochastic kernel Pyu\s ^ mapping g: {y, u,s) ^-^ x which, combined with Ps 
and the channel law, determines the joint distribution (26) for which (25) is satisfied. For a given 
block-length n, we generate a random code as follows: 

Codebook: Generate 2"^i t/-bins, each containing 2"^^ y-tuples where the h-th y-tuple in 
the mi-th bin 

y{m,,h), mie{l,...,2"«^}, /i e {1, . . . , 2"^^} 

is generated IID according to Py (the F-marginal of (26)) independently of the other y- 
tuples. Additionally, generate 2"^^ ^x-bins, each containing 2"^^ it-tuples, where the /2-th 
w-tuple in the 7712 -th u-hin 

u(m2, ^2), m2 e {1, . . . , 2"^^}, ^2 e {1, . . . , 2"^^} 

is drawn IID according to Pu (the U -marginal of (26)) independently of the other w-tuples 
and of the ^/-tuples. 

Encoder: To send message mi e {!,..., 2^^} to Receiver 1 and message m2 e {!,..., 2^^} 
to Receiver 2, look for a y-tuple y(mi, h) in y-bin mi and a it-tuple u(m2, ^2) in u-hin 7712 
such that (y(mi, Zi), u(m2, ^2)) is jointly typical with the state sequence s. If such a pair 
can be found, send 

X = 5(y(mi,/i),u(m2,/2),s), (27) 

where in the above ^'(y, u, s) denotes the application of the function g{y^ u, s) componen- 
twise. In this case the sequence received by the deterministic receiver will be y(mi,/i). 
Otherwise send an arbitrary codeword. 

Decoder 1: Try to find the unique y-hm, say m'^, that contains the received sequence y and 
output its number m'^. If there is more than one such bin, declare an error. 

Decoder 2: Try to find the unique it-tuple u(m2, ^2) that is jointly typical with the received 
sequence z and output its bin number m'2. If more than one or no such u can be found, 
declare an error. 

We next analyze the error probability of the above coding scheme. There are three types of 
errors: 
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Decoder 1 errs. This happens only if there is more than one bin that contains the received 
y. This probabiUty tends to zero as n tends to infinity provided 

Ri + Ri<H(Y). (28) 

Decoder 2 errs. This happens if either the ?i-tuple u(m2, 12) is not jointly typical with the 
received 2;-tuple, or if a different ti-tuple happens to be jointly typical with the received 
z-tuple. The probability of the former case tends to zero as n tends to infinity by the Markov 
Lemma [2]. The probability of the latter case tends to zero as n tends to infinity provided 

R2 + R2<I{U;Z). (29) 

Encoder errs. This happens only if there is no (l^h) e {1, • • • , 2"^i} x {!,..., 2"^2-}. 
such that (y(mi, Zi), u(m2, /2)) is jointly typical with the state sequence s. To bound this 
error probability, we generalize the Mutual Covering Lemma (see [12]). Without loss of 
generality, assume that the messages to be transmitted are both 1. Define 

Ai,i2 - 1 {(y(l> h),u{l, I2)) is jointly typical with s} (30) 

and 

The Encoder errs if, and only if, 

A = 0. (32) 

By Chebyshev's Inequality 

Pr[A = 0] < Pr [|A - E[A] \ > E[A]] < (33) 

To bound E.[A], note that for any li and I2 the vectors y(l, li), u(l, I2) and s are generated 
independently according to their marginal distributions. Hence, for large enough n, the 
probability that they are jointly typical is lower-bounded by 

Pr [Ai^ Z2 = 1] > 2-"(^('^)+^^'^)+-^(^)-^(^'^'^)+^') (34) 
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SO 



E[A] = 5]EK,,,] (35) 

= 5^PrK,,, = l] (36) 

^ 2n{Ri+R2-H{Y)-HiU)-H{S)+H{Y,U,S)-4e) ^^7) 

To bound Var(A) we first write it as 

Var(yl) = Var(A,,J + J2 Gov [A,,,,, A,,,,,] 

+ J2 Cov[Ai,j„ A^,^] + Cov[A,;„ , (38) 

and then bound the four terms on the RHS of (38) separately. For the first term, note that 

Var(A,,,,J = E[4J-E[A,,,J^ (39) 

= Pr[A,,;, = l], (41) 



so 



J]Var(A,„,J<^Pr[A,,,,, = l] (42) 

^E[A]. (43) 
For the second term on the RHS of (38) we have 

Coy[Ai^,i„Ai>,i^] < E[Ai,,i,Ar^,i^] (44) 

= Pr[A^,,;, = = 1]. (45) 

The probability that u(l,/2) and s are jointly typical is upper-bounded by 2~'^^^^^'^^''^^\ 
Given that (u(l,/2),s) are jointly typical, the sequences y(l,/i) and y{l,l[) are inde- 
pendent, and each of them is jointly typical with (u(l,^2),s) with probability at most 

2-n{I{Y;U,S)-3e)_ HcnCC 

Coy[Ai^,i„Ai,^,^] < 2-"(^(f^=5)-3^) . (^2-^iiiy;U,s)-se)y (45^ 

^ 2-n(I{U;S)+2IiY;U,S)-9e) ^ ^4y^ 
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so 

Similarly for the third term on the RHS of (38) we have 

ll,l2 

We next analyze the fourth term on the RHS of (38). When l[ ^ l\ and 7^ h the random 
variables Ai^^i^ and are independent and hence uncorrelated. Consequently 

5^Cov[T;„,„T,i,J =0. (50) 

Summarizing (33), (37), (38), (43), (48), (49) and (50) we conclude that the probability 
that A is zero, i.e., that the encoder errs, tends to zero as n tends to infinity provided that 
e tends to zero and 

R^>1{Y;S) (51a) 
R2>I{U;S) (51b) 
Ri + R2> H{Y) + H{U) + H{S) - H{Y, U, S). (51c) 

Summarizing (28), (29) and (51) we conclude that the above coding scheme has vanishing error 
probability as n tends to infinity for all (i?i,i?2) satisfying (2). By time-sharing we further 
achieve the convex hull of all rate-pairs satisfying (2) for joint distributions of the form (26). 
This concludes the proof of the direct part of Theorem 1. 

IV. Converse Part 

Our proof of the converse part of Theorem 1 employs ideas from Nair and El Gamal's outer 
bound [13] and of Gel'fand and Pinkser's converse for the state-dependent single-user channel 
[6], but it also contains some new elements. 

We shall show that, even if the state sequence S is revealed to the deterministic receiver (which 
observes Y), any achievable rate-pair must be in the convex closure of the union of rate-pairs 
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satisfying (2). Given any code of block-length n, we first derive a bound on Ri. 

nRi = H{Mi) (52) 

</(Mi;F",5")+ne„ (53) 

= J(Mi;F"|5^) + ne„ (54) 

n 

= J2 IiMi;Yi\Y'-\ 5") + ne„ (55) 

i=l 
n 

<J2H{nY'-\S") + nen (56) 

i=l 

n 

<5]//(Fi|5,) + ^e„, (57) 

i=l 

where e„ is a function of n which decays to zero as n goes to infinity. Here, (53) follows from 
Fano's Inequality; (54) because Mi and 5"" are independent; (55) from the chain rule; (56) by 
dropping negative terms; and (57) because conditioning cannot increase entropy. 
We next bound R2 as in [6]: 

nR2 = H{M2) (58) 
</(M2;Z")+ne„ (59) 

n 

= ^/(7Vf2;Z,|Z'-i)+ne„ (60) 

i=l 

n n 

= Y,I{M2, ^r+i; Z,\ Z'-') M2, Z'-') + nen (61) 

i=l i=l 

n n 

= ^ / (M2, St,,; Zi\ Z'-') - J2 1 {Z'-'; Si \M2, St,,) + ne„ (62) 

i=l i=l 
n n 

= J] 7 (M2, St+,; Zi\ Z'-^) - J] 7 (M2, Z'-\ St+,; S^) + ne„ (63) 

1=1 i=i 

n n 

< I (M2, Z'-\ St,,; Z,)-Y,I (M2, Z'-\ St,,; S,) + ne„ (64) 

i=l i=l 
n 

= J2 ^Ci4; Zi) - I{K; Si) + ne„. (65) 

i=l 

Here, (59) follows from Fano's Inequality; (60) and (61) from the chain rule; (62) from Csiszar's 
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Identity [14] 

n n 

{C:-,l■,D^\D^-') =J2l {D^-';Q\C^_,,)- (66) 



=1 i=l 



(63) because Si and (M2, ) are independent; (64) from the chain rule and by dropping 
negative terms; and (65) by defining the auxiliary random variables 

Vi^{M2,Z'-\S^^,), ie{l,...,n}. (67) 

We next bound the sum rate i?i + R2: 

n{Ri + R2)^H{M^,M2) (68) 
= H{M2) + H{Mi\M2) (69) 
< /(M2; Z") + /(Mi; r", M2) + ne„, (70) 

where the last step follows from Fano's Inequality. Of the two mutual informations on the RHS 
of (70) we first bound /(M2; Z"): 

n 

/(M2; Z") = 5^ /(M2; Zi|Z^-^) (71) 
?=i 

n 

<J2HM2,z^-';Zi) (72) 

i=l 
n 

= 5]/(M2,z^-\5]Vi,y,!ti;zO 

1 

n 

Y,HS?+i,yr+i;Zi\M2,Z'-') (73) 



«=1 



i=l 



Y,l{M2,Z'-\S^^„Y-^,-Z,) 
1=1 

n 

- J2 1 {Z'-'; Si, Yi \M2, S^^„Y^^,) (74) 
1=1 

n 

J2HM2,Z^-\S:^„Yr^,;Z,) 

i=l 

n 

-J2l{M2,Z'-\S^_,„Yr^,;Si,Yi) 

i=l 
n 

+ J]/(M2,51Vi,F,!ti;5„rO- (75) 



1=1 
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Here, (71), (72) and (73) follow from the chain rule; (74) by applying Csiszar's Identity (66) 
between (S'",!'") and Z"; and (75) again from the chain rule. 

We next study the sum of the last term on the RHS of (75) and the second mutual information 
on the RHS of (70): 

n 

J2 1 {M2, ^r+i, y^ii, Si, + i{M,- y", 5"|M2) 
1=1 

n 

1=1 

n 

+ I (A^i; Si, Yi |M2, YJX^ ) (76) 

i=l 

n 

= ^ / (Ml, M2, 5,, F,) (77) 

1=1 

n 

= J]/(Mi,M2,51Vi,F,;i;5,,F,) 

n 

+ J] 7 {S'-^- Si, Y,\M„ M„ S:^„ Y^,) 
1=1 

n 

- 5^ / (^r+i, Ml, M2, 5'-^) (78) 

i=l 

n 

= J2 1 {Ml, M2, S'-\ St,,, Y;1,; Si, Yi) 

i=l 

n 

- J2 1 (^r+i, Si\M,, M2, S'-') (79) 

i=l 

n 

= I (Ml, M2, s'-\ 5iVi, r,!ti; 5m 

i=l 

n 

-Yl{M,, M2, S'-\ ^IVi, Y^li^ S,) (80) 

i=l 

n 

= 5^ / (Ml, M2, ^r+i, F,!ti; Yi\ Si) (81) 

i=l 

n 

= J2H{Yi\Si). (82) 

i=l 

Here, (76) and (77) follow from the chain rule; (78) by applying Csiszar's Identity between 
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F") and 5", which yields 

n n 

J2 1 (^r+i, -^^l ^1, ^2, S'-') = ^ / (5^-^ Si, |Mi, M2, ^IVi, ; (83) 

i=l i=l 

(79) from the chain rule; (80) because Si and (Mi, M2, S"*"^) are independent; (81) again from 
the chain rule; and (82) because, given (Mi, M2, 5"), the channel inputs X'^ are determined by 
the encoder, and hence are also determined, so 

i^(ri|Ml,M2,5^F,!;l) =0. (84) 

Combining (70), (75) and (82), using the definitions (67), and further defining 

r,4y-i, ze{i,...,n}, (85) 

we obtain 



n(i?i + i?2) < X! ^(^i' Zi) - E ^(^i' ^0 

n 

+ Y,H{Yi\Si)+nen. (86) 

Summarizing (57), (65) and (86) and letting n go to infinity we obtain that any achievable 
rate-pair (i?i, i?2) must be contained in the convex closure of the union of rate-pairs satisfying 

Ri<H{Y\S) (87a) 
i?2 < I{V] Z) - I{V; S) (87b) 
Ri + R2< H{Y\S) + I{V, T; Z) - I{V, T; S, Y) (87c) 

where, given (X, S), the outputs (F, Z) are drawn according to the channel law (1) independently 
of the auxiliary random variables (V,T). To prove the converse part of Theorem 1, it remains 
to replace V and T with a single auxiliary random variable. I.e., it remains to find an auxiliary 
random variable U such that 

I{V; Z) - I{V; S) < I{U; Z) - I{U; S), (88a) 
H{Y\S) + I{V,T;Z)-I{V,T;S,Y) 

< H{Y\S) + I(U; Z) - I(U; S, Y) . (88b) 
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In fact, as we shall see, either choosing U to he V will satisfy (88) or else choosing it to be 
(y,T) will satisfy (88). If we choose U — V, then (88a) is satisfied with equality, and the 
requirement (88b) becomes 

I(T;Z\V) - I(T;S,Y\V) <0. (89) 

On the other hand, if we choose U — (V,T), then (88b) is satisfied with equality, and the 
requirement (88a) becomes 

7(T; Z\V) - I(T; S\V) > 0. (90) 

It remains to show that at least one of the requirements (89) and (90) must be satisfied: if it is 
(89), then we shall choose U as V, and it if is (90), then we shall choose U as (V,T). To this 
end we note that for all random variables T, Z, V, S, Y 

I{T- Z\V) - I{T- S, Y\V) < I{T- Z\V) - I{T; S\V), (91) 

because the RHS minus the left-hand side equals I{T; Y\S,V) which is nonnegative. Therefore, 
at least one of (89) and (90) must be satisfied. We have thus shown that there must exist a 
U which satisfies both inequalities in (88), hence the bounds (87) can be relaxed to (2). This 
concludes the proof of the converse part of Theorem 1. 

Acknowledgments 

L.W. acknowledges support from the US Air Force Office of Scientific Research under Grant 
No. FA9550-1 1-1-0183 and the National Science Foundation under Grant No. CCF- 1017772. 

References 

[1] T. M. Cover and J. A. Thomas, Elements of Information Theory. John Wiley & Sons, 1991. 
[2] A. El Gamal and Y.-H. Kim, Network Information Theory. Cambridge University Press, 2011. 

[3] Y. Steinberg, "Coding for the degraded broadcast channel with random parameters, with causal and noncausal side 

information," IEEE Trans. Inform. Theory, vol. 51, no. 8, pp. 2867-2877, Aug. 2005. 
[4] Y. Steinberg and S. Shamai (Shitz), "Achievable rates for the broadcast channel with states known at the transmitter," in 

Proc. IEEE Int. Symp. Inform. Theory, Adelaide, Australia, Sept. 4-9, 2005, pp. 2184-2188. 
[5] K. Marton, "A coding theorem for the discrete memoryless broadcast channel," IEEE Trans. Inform. Theory, vol. 25, no. 3, 

pp. 306-311, May 1979. 

[6] S. I. Gel'fand and M. S. Pinsker, "Coding for channel with random parameters," Prob. Contr. and Inform. Theory, vol. 9, 
no. 1, pp. 19-31, 1980. 



DRAFT 



16 



[7] T. M. Cover, "Comments on broadcast channels," IEEE Trans. Inform. Theory, vol. 44, no. 6, pp. 2524-2530, Oct. 1998. 
[8] S. I. Gel'fand, "Capacity of one broadcast channel," Problemy Peredachi Informatsii (Problems of Inform. Transm.), vol. 13, 

no. 3, pp. 106-108, July-Sept. 1977. 
[9] K. Marten, "The capacity region of deterministic broadcast channels," Trans. Int. Symp. Inform. Theory, 1911 . 
[10] M. S. Pinsker, "Capacity of noiseless broadcast channels," Problemy Peredachi Informatsii (Problems of Inform. Transm.), 

vol. 14, no. 2, pp. 28-34, Apr.-June 1978. 
[11] S. I. Gel'fand and M. S. Pinsker, "Capacity of a broadcast channel with one deterministic component," Problemy Peredachi 

Informatsii (Problems of Inform. Transm.), vol. 16, no. 1, pp. 17-25, Jan.-Mar. 1980. 
[12] A. El Gamal and E. C. van der Meulen, "A proof of Marton's coding theorem for the discrete memoryless broadcast 

channel," IEEE Trans. Inform. Theory, vol. 27, no. 1, pp. 120-122, Jan. 1981. 
[13] C. Nair and A. El Gamal, "An outer bound to the capacity region of the broadcast channel," IEEE Trans. Inform. Theory, 

vol. 53, no. 1, pp. 50-55, Jan. 2007. 
[14] I. Csiszar and J. Korner, Information Theory: Coding Theorems for Discrete Memoryless Systems. Academic Press, 1981. 



DRAFT 



